Alex Merced's Data, Dev and AI Blog

Tag: Data Engineering

Batch vs. Streaming: Choose the Right Processing Model

2026-02-18

Conceptual, Logical, and Physical Data Models Explained

2026-02-18

Data Engineering Best Practices: The Complete Checklist

2026-02-18

Data Modeling Best Practices: 7 Mistakes to Avoid

2026-02-18

Data Modeling for Analytics: Optimize for Queries, Not Transactions

2026-02-18

Data Modeling for the Lakehouse: What Changes

2026-02-18

Data Quality Is a Pipeline Problem, Not a Dashboard Problem

2026-02-18

Data Vault Modeling: Hubs, Links, and Satellites

2026-02-18

Denormalization: When and Why to Flatten Your Data

2026-02-18

Dimensional Modeling: Facts, Dimensions, and Grains

2026-02-18

How to Design Reliable Data Pipelines

2026-02-18

How to Think Like a Data Engineer

2026-02-18

Idempotent Pipelines: Build Once, Run Safely Forever

2026-02-18

Partition and Organize Data for Performance

2026-02-18

Pipeline Observability: Know When Things Break

2026-02-18

Schema Evolution Without Breaking Consumers

2026-02-18

Slowly Changing Dimensions: Types 1-3 with Examples

2026-02-18

Star Schema vs. Snowflake Schema: When to Use Each

2026-02-18

Testing Data Pipelines: What to Validate and When

2026-02-18

What Is Data Modeling? A Complete Guide

2026-02-18

dremioframe & iceberg - Pythonic interfaces for Dremio and Apache Iceberg

2025-12-05

Introducing dremioframe - A Pythonic DataFrame Interface for Dremio

2025-11-29

Comprehensive Hands-on Walk Through of Dremio Cloud Next Gen (Hands-on with Free Trial)

2025-11-12

2025-2026 Guide to Learning about Apache Iceberg, Data Lakehouse & Agentic AI

2025-10-23

An Exploration of the Commercial Iceberg Catalog Ecosystem

2025-10-21

Building a Universal Lakehouse Catalog - Beyond Iceberg Tables

2025-10-17

Intro to Apache Iceberg with Apache Polaris and Apache Spark

2025-10-16

The State of Apache Iceberg v4 - October 2025 Edition

2025-10-14

The Ultimate Guide to Open Table Formats - Iceberg, Delta Lake, Hudi, Paimon, and DuckLake

2025-09-24

The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem

2025-09-23

The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

2025-09-16

Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

2025-09-09

Unlocking the Power of Agentic AI with Apache Iceberg and Dremio

2025-09-05

Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

2025-09-02

Designing the Ideal Cadence for Compaction and Snapshot Expiration

2025-08-19

How to Discover or Organize Lakehouse & Apache Iceberg Meetups

2025-07-03

Introduction to Data Engineering Concepts | What is Data Engineering?

2025-05-02

Introduction to Data Engineering Concepts | Understanding Data Sources and Ingestion

2025-05-02

Introduction to Data Engineering Concepts | ETL vs ELT – Understanding Data Pipelines

2025-05-02

Introduction to Data Engineering Concepts | Batch Processing Fundamentals

2025-05-02

Introduction to Data Engineering Concepts | Streaming Data Fundamentals

2025-05-02

Introduction to Data Engineering Concepts | Data Modeling Basics

2025-05-02

Introduction to Data Engineering Concepts | Data Warehousing Fundamentals

2025-05-02

Introduction to Data Engineering Concepts | Data Lakes Explained

2025-05-02

Introduction to Data Engineering Concepts | Storage Formats and Compression

2025-05-02

Introduction to Data Engineering Concepts | Data Quality and Validation

2025-05-02

Introduction to Data Engineering Concepts | Metadata, Lineage, and Governance

2025-05-02

Introduction to Data Engineering Concepts | Scheduling and Workflow Orchestration

2025-05-02

Introduction to Data Engineering Concepts | Building Scalable Pipelines

2025-05-02

Introduction to Data Engineering Concepts | DevOps for Data Engineering

2025-05-02

Introduction to Data Engineering Concepts | Cloud Data Platforms and the Modern Stack

2025-05-02

Introduction to Data Engineering Concepts | Data Lakehouse Architecture Explained

2025-05-02

Introduction to Data Engineering Concepts | Apache Iceberg, Arrow, and Polaris

2025-05-02

Introduction to Data Engineering Concepts | The Power of Dremio in the Modern Lakehouse

2025-05-02

Video Course - Basics of Lakehouse Engineering - Apache Iceberg, Nessie, Dremio

2024-06-26

What is the Data Lakehouse and the Role of Apache Iceberg, Nessie and Dremio?

2024-02-21

No Code - Convert XLS/CSV files into Parquet with Dremio

2023-12-18

📬 Join the Mailing List

Get updates directly to your inbox.

Subscribe Now

Menu

Search